Search CORE

5 research outputs found

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

Author: Assael Yannis M.
de Freitas Nando
Foerster Jakob N.
Whiteson Shimon
Publication venue
Publication date: 01/01/2016
Field of study

We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate end-to-end learning of protocols in complex environments inspired by communication riddles and multi-agent computer vision problems with partial observability. We propose two approaches for learning in these domains: Reinforced Inter-Agent Learning (RIAL) and Differentiable Inter-Agent Learning (DIAL). The former uses deep Q-learning, while the latter exploits the fact that, during learning, agents can backpropagate error derivatives through (noisy) communication channels. Hence, this approach uses centralised learning but decentralised execution. Our experiments introduce new environments for studying the learning of communication protocols and present a set of engineering innovations that are essential for success in these domains

arXiv.org e-Print Archive

Oxford University Research Archive

DualLip: A System for Joint Lip Reading and Generation

Author: Assael Yannis M
Chung Joon Son
King Davis E.
Kingma Diederik
Kumar Rithesh
McGurk Harry
Paszke Adam
Qu Leyuan
Ren Yi
Rudrabha Mukhopadhyay Prajwal KR
van den Oord Aaron
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/09/2020
Field of study

Lip reading aims to recognize text from talking lip, while lip generation aims to synthesize talking lip according to text, which is a key component in talking face generation and is a dual task of lip reading. In this paper, we develop DualLip, a system that jointly improves lip reading and generation by leveraging the task duality and using unlabeled text and lip video data. The key ideas of the DualLip include: 1) Generate lip video from unlabeled text with a lip generation model, and use the pseudo pairs to improve lip reading; 2) Generate text from unlabeled lip video with a lip reading model, and use the pseudo pairs to improve lip generation. We further extend DualLip to talking face generation with two additionally introduced components: lip to face generation and text to speech generation. Experiments on GRID and TCD-TIMIT demonstrate the effectiveness of DualLip on improving lip reading, lip generation, and talking face generation by utilizing unlabeled data. Specifically, the lip generation model in our DualLip system trained with only10% paired data surpasses the performance of that trained with the whole paired data. And on the GRID benchmark of lip reading, we achieve 1.16% character error rate and 2.71% word error rate, outperforming the state-of-the-art models using the same amount of paired data.Comment: Accepted by ACM Multimedia 202

arXiv.org e-Print Archive

Crossref

EchoWhisper

Author: Assael Yannis M
Cloud Google
Cordourier Maruri Héctor A
Deng Yunbin
English Rachel's
ETS.
Gao Yang
Hansen Colin N
Kang Jian
Li Zeshui
McGregor Gloria
Min Beomjun
Park Daniel S
Ponomarenko Dmitriy
Sohn Jongseo
Wang Guanhua
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Using deep Q-learning to understand the tax evasion behavior of risk-averse firms

Author: Allingham
Alm
Alm
Andreoni
Artavanis
Baldry
Bayer
Bertsekas
Bornstein
Clotfelter
Cowell
Crane
Das-Gupta
Dimitrios Hristu-Varsakelis
Fleming
Foerster
Gao
Garrido
Gosavi
Goumagias
Hokamp
Jaakkola
Kingma
Krauss
Krizhevsky
LeCun
Leung
Lin
Markellos
Martinez-Vazquez
Mikolov
Mnih
Nikolaos D. Goumagias
Perozzi
Pickhardt
Ronao
Ross
Schmidhuber
Singh
Sutton
Tadepalli
Tsitsiklis
Tsitsiklis
van Hasselt
van Hasselt
Wang
Watkins
Wheeler
Yannis M. Assael
Yitzhaki
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref